Why might we be interested in the area under a graph? Consider an object moving in a straight line with constant velocity \(v\) (assumed positive). The distance traveled over a time interval \([t_1,t_2]\) is equal to \(v\Delta t\) where \(\Delta t = (t_2 - t_1)\) is the time elapsed. This is the well-known formula
\[\boxed{\bbox[#FAF8ED, 5pt]{ \text{Distance traveled} = \overbrace{\text{velocity}\times\text{time elapsed}}^{v\Delta t} }}\tag{1}\]
Because \(v\) is constant, the graph of velocity is a horizontal line (Figure 5.1) and \(v\Delta t\) is equal to the area of the rectangular region under the graph of velocity over \([t_1,t_2]\). So we can write Eq. (1) as \[ \boxed{ \bbox[#FAF8ED,5pt]{ \text{Distance traveled}=\text{area under the graph of velocity over }[t_1,t_2]}}\tag{2} \]
There is, however, an important difference between these two equations: Eq. (1) makes sense only if velocity \(v\) is constant whereas Eq. (2) is correct even if the velocity changes with time (we will prove this in Section 5.5). Thus, the advantage of expressing distance traveled as an area is that it enables us to deal with much more general types of motion.
To see why Eq. (2) might be true in general, let’s consider the case where velocity changes over time but is constant on intervals. In other words, we assume that the object’s velocity changes abruptly from one interval to the next as in Figure 5.2. The distance traveled over each interval is equal to the area of the rectangle above that interval, so the total distance traveled is the sum of the areas of the rectangles. In Figure 5.2,
287
\[ \text{Distance traveled over }[0,8] = \underbrace{10+15+30+10}_{\text{Sum of areas of rectangles}} = 65\text{ m} \]
Our strategy when velocity changes continuously ( Figure 5.3) is to approximate the area under the graph by sums of areas of rectangles and then pass to a limit. This idea leads to the concept of an integral.
Our goal is to compute the area under the graph of a function \(f(x)\). In this section, we assume that \(f(x)\) is continuous and positive, so that the graph of \(f(x)\) lies above the \(x\)-axis (Figure 5.4). The first step is to approximate the area using rectangles.
Recall the two-step procedure for finding the slope of the tangent line (the derivative): First approximate the slope using secant lines and then compute the limit of these approximations. In integral calculus, there are also two steps:
To begin, choose a whole number \(N\) and divide \([a,b]\) into \(N\) subintervals of equal width, as in Figure 5.4(A). The full interval \([a,b]\) has width \(b-a\), so each subinterval has width \(\Delta x=\tfrac{b-a}{N}\). The right endpoints of the subintervals are
\[\boxed{\bbox[#FAF8ED,5pt]{a+\Delta x, a+2\Delta x,\ldots,a+(N-1)\Delta x,a+N\Delta x}}\]
Note that the last right endpoint is \(b\) because \(a+N\Delta x = a + N (\tfrac{b-a}{N}) = b\). Next, as in Figure 5.4(B), construct, above each subinterval, a rectangle whose height is the value of \(f(x)\) at the right endpoint of the subinterval.
The sum of the areas of these rectangles provides an approximation to the area under the graph. The first rectangle has base \(\Delta x\) and height \(f(a+\Delta x)\), so its area is \(f(a+\Delta x)\Delta x\). Similarly, the second rectangle has height \(f(a+2\Delta x)\) and area \(f(a+2\Delta x)\Delta x\), etc. The sum of the areas of the rectangles is denoted \(R_N\) and is called the \(N^\text{th}\) right-endpoint approximation:
288
\[ R_N = f(a+\Delta x)\Delta x + f(a+2\Delta x)\Delta x+\cdots + f(a+N\Delta x)\Delta x \]
To summarize,
\(a = \text{left endpoint of the interval }[a,b]\)
\(b = \text{right endpoint of the interval }[a,b]\)
\(N = \text{number of subintervals in }[a,b]\)
\(\Delta x = \dfrac{b-a}{N}\)
Factoring out \(\Delta x\), we obtain the formula
\[ \boxed{\bbox[#FAF8ED,5pt]{R_N = \Delta x \left( f(a+\Delta x) + f(a+2\Delta x)+\cdots + f(a+N\Delta x)\right)} } \]
In words: \(R_N\) is equal to \(\Delta x\) times the sum of the function values at the right endpoints of the subintervals.
Calculate \(R_4\) and \(R_6\) for \(f(x) = x^2\) on the interval \([1, 3]\).
Solution
Step 1. Determine \(\Delta x\) and the right endpoints.
To calculate \(R_4\), divide \([1, 3]\) into four subintervals of width \(\Delta x = \tfrac{3-1}{4}=\tfrac{1}{2}\).The right endpoints are the numbers \(a + j\Delta x = 1 + \tfrac{j}{2}\) for \(j = 1, 2, 3, 4\). They are spaced at intervals of \(\tfrac12\) beginning at \(\tfrac32\), so, as we see in Figure 5.5 (move the slider to display \(R_4\)), the right endpoints are \(\tfrac32,\tfrac42,\tfrac52,\tfrac62\).
Step 2. Calculate \(\Delta x\) times the sum of function values.
\(R_4\) is \(\Delta x\) times the sum of the function values at the right endpoints:
\[ \begin{aligned} R_4 & =\dfrac{1}{2} \left( f\left(\dfrac32\right) + f\left(\dfrac42\right) + f\left(\dfrac52\right) + f\left(\dfrac62\right) \right) \\ & =\dfrac{1}{2} \left( \left(\dfrac32\right)^2 + \left(\dfrac42\right)^2 + \left(\dfrac52\right)^2 + \left(\dfrac62\right)^2 \right) = \dfrac{43}{4} = 10.75 \end{aligned} \]
\(R_6\) is similar: \(\Delta x = \tfrac{3-1}{6} = \tfrac13\), and the right endpoints are spaced at intervals of \(\tfrac13\) beginning at \(\tfrac43\) and ending at \(3\), as in Figure 5.5 (move the slider to display \(R_6\)). Thus,
\[ \begin{aligned} R_6 & =\dfrac{1}{3} \left( f\left(\dfrac43\right) + f\left(\dfrac53\right) + f\left(\dfrac63\right) + f\left(\dfrac73\right)+ f\left(\dfrac83\right)+ f\left(\dfrac93\right) \right) \\ & =\dfrac{1}{3} \left( \left(\dfrac43\right)^2 + \left(\dfrac53\right)^2 + \left(\dfrac63\right)^2 + \left(\dfrac73\right)^2 + \left(\dfrac83\right)^2 + \left(\dfrac93\right)^2 \right) = \dfrac{271}{27} \approx 10.037 \end{aligned} \]
289
Summation notation is a standard notation for writing sums in compact form. The sum of numbers \(a_m,\ldots,a_n\) (\(m \leq n\)) is denoted
\[ \sum\limits_{j=m}^n a_j = a_m + a_{m+1} + \cdots + a_n \]
The Greek letter \(\Sigma\) (capital sigma) stands for “sum,” and the notation \(\sum\limits_{j=m}^n\) tells us to start the summation at \(j=m\) and end it at \(j=n\). For example,
\[ \sum\limits_{j=1}^5 j^2 = 1^2 + 2^2 + 3^2 + 4^2 + 5^2 = 55 \]
In this summation, the \(j\)th term is \(a_j = j^2\). We refer to \(j^2\) as the general term. The letter \(j\) is called the summation index. It is also referred to as a dummy variable because any other letter can be used instead. For example,
\[ \begin{aligned} \sum\limits_{k=4}^6 (k^2 - 2k) & = \overbrace{\left(4^2 - 2(4)\right)}^{k=4} + \overbrace{\left(5^2 - 2(5)\right)}^{k=5} + \overbrace{\left(6^2 - 2(6)\right)}^{k=6} = 47\\ \sum\limits_{m=7}^9 1 & = 1+1+1 = 3 \quad (\text{because }a_7 = a_8 = a_9 = 1) \end{aligned} \]
The usual commutative, associative, and distributive laws of addition give us the following rules for manipulating summations.
For example,
\[ \sum\limits_{j=3}^5 (j^2 + j) = (3^2+3) + (4^2+4) + (5^2+5) \]
is equal to
\[ \sum\limits_{j=3}^5 j^2 + \sum\limits_{j=3}^5 j = (3^2+4^2+5^2) + (3+4+5) \]
290
Linearity can be used to write a single summation as a sum of several summations. For example,
\[ \begin{aligned} \sum\limits_{k=0}^{100} (7k^2 - 4k + 9) & = \sum\limits_{k=0}^{100} 7k^2 + \sum\limits_{k=0}^{100} (-4k) + \sum\limits_{k=0}^{100} 9\\ & = 7 \sum\limits_{k=0}^{100} k^2 - 4\sum\limits_{k=0}^{100} k + 9\sum\limits_{k=0}^{100} 1 \end{aligned} \]
It is convenient to use summation notation when working with area approximations. For example, \(R_N\) is a sum with general term \(f(a+j\Delta x)\):
\[ R_N = \Delta x \left( f(a+\Delta x) + f(a+2\Delta x) + \cdots + f(a+N\Delta x)\right) \]
The summation extends from \(j=1\) to \(j=N\), so we can write \(R_N\) concisely as \[ \boxed{\bbox[#FAF8ED,5pt]{\displaystyle R_N = \Delta x \sum\limits_{j=1}^N f(a+j\Delta x)}} \]
REMINDER
\[ \Delta x = \frac{b-a}{N} \]
We shall make use of two other rectangular approximations to area: the left-endpoint and the midpoint approximations. Divide \([a,b]\) into \(N\) subintervals as before. In the left-endpoint approximation \(L_N\), the heights of the rectangles are the values of \(f(x)\) at the left endpoints [Figure 5.6, set to "Left-endpoint"]. These left endpoints are
\[ a, a+\Delta x,a+2\Delta x, \ldots, a+(N-1)\Delta x \]
and the sum of the areas of the left-endpoint rectangles is
\[ L_N = \Delta x\left( f(a) + f(a+\Delta x) + f(a+2\Delta x) + \cdots + f(a+(N-1)\Delta x)\right) \]
Note that both \(R_N\) and \(L_N\) have general term \(f(a+j\Delta x)\), but the sum for \(L_N\) runs from \(j=0\) to \(j=N-1\) rather than from \(j=1\) to \(j=N\):
\[ \boxed{\bbox[#FAF8ED,5pt]{\displaystyle L_N = \Delta x \sum\limits_{j=0}^{N-1} f(a+j\Delta x)}} \]
In the midpoint approximation \(M_N\), the heights of the rectangles are the values of \(f(x)\) at the midpoints of the subintervals rather than at the endpoints. As we see in Figure 5.6, set to "Midpoint", the midpoints are
\[ a+\frac12\Delta x, a+\frac32\Delta x, \ldots, a+ \left(N - \frac12\right)\Delta x \]
The sum of the areas of the midpoint rectangles is
\[ M_N = \Delta x \left( f\left(a+\frac12\Delta x\right) + f\left(a+\frac32\Delta x\right) + \cdots + f\left(a+ \left(N - \frac12\right)\Delta x\right)\right) \]
In summation notation,
\[ \boxed{\bbox[#FAF8ED,5pt] {\displaystyle M_N = \Delta x \sum\limits_{j=1}^N f(a+(j - \frac12)\Delta x)} } \]
291
Calculate \(R_6\), \(L_6\), and \(M_6\) for \(f(x)= x^{-1}\) on \([2, 4]\).
Solution In this case, \(\Delta x = \tfrac{b-a}N = \tfrac{4-2}6 = \tfrac13\). The general term in the summation for \(R_6\) and \(L_6\) is
\[ f(a+j\Delta x) = f\left( 2+\frac j3\right) = \frac1{2+\tfrac j3} = \frac3{6+j} \]
Therefore (Figure 5.7),
\[ \begin{aligned} R_6 & = \frac13 \sum\limits_{j=1}^6 f\left( 2+\frac j3\right) = \frac13 \sum\limits_{j=1}^6\frac3{6+j} \\ & = \frac13\left(\frac37 +\frac38+\frac39+\frac3{10}+\frac3{11}+\frac3{12}\right)\approx 0.653 \end{aligned} \]
In \(L_6\), the sum begins at \(j=0\) and ends at \(j=5\):
\[ L_6 = \frac13 \sum\limits_{j=0}^5\frac3{6+j} = \frac13\left(\frac36+\frac37 +\frac38+\frac39+\frac3{10}+\frac3{11}\right) \approx 0.737 \]
The general term in \(M_6\) is
\[ f\left(a+\left(j - \frac12\right)\Delta x\right) = f\left(2+\left(j - \frac12\right)\frac13\right) = \frac1{2+\frac j3 - \frac16} = \frac6{12+2j -1} \]
Summing up from \(j=1\) to \(j=6\), we obtain (Figure 5.8)
\[ \begin{aligned} M_6 & = \frac13 \sum\limits_{j=1}^6 f\left(2+\left(j - \frac12\right)\frac13\right) = \frac13 \sum\limits_{j=1}^6\frac6{12+2j -1} \\ & = \frac13 \left(\frac6{13} + \frac6{15} + \frac6{17} + \frac6{19} + \frac6{21} + \frac6{23} \right) \approx 0.692 \end{aligned} \]
Observe in Figure 5.7 that the left-endpoint rectangles for \(f(x)= x^{-1}\) extend above the graph and the right-endpoint rectangles lie below it. The exact area \(A\) must lie between \(R_6\) and \(L_6\), and so, according to the previous example, \(0.65\leq A\leq 0.74\). More generally, when \(f(x)\) is monotonic (increasing or decreasing), the exact area lies between \(R_N\) and \(L_N\) (Figure 5.9):
292
Figure 5.10 shows several right-endpoint approximations. Notice that the error, corresponding to the yellow region above the graph, gets smaller as the number of rectangles increases. In fact, it appears that we can make the error as small as we please by taking the number \(N\) of rectangles large enough. If so, it makes sense to consider the limit as \(N\rightarrow\infty\), as this should give us the exact area under the curve. The next theorem guarantees that the limit exists.
In Theorem 1, it is not assumed that \(f(x)\leq 0\). If \(f(x)\) takes on negative values, the limit \(L\) no longer represents area under the graph, but we can interpret it as a sum and difference of areas, which is discussed in the next section.
If \(f(x)\) is continuous on \([a,b]\), then the endpoint and midpoint approximations approach one and the same limit as \(N\rightarrow\infty\). In other words, there is a value \(L\) such that
\[ \lim\limits_{ N\rightarrow\infty } R_N= \lim\limits_{ N\rightarrow\infty}L_N = \lim\limits_{ N\rightarrow \infty} M_N = L \]
If \(f(x)\geq 0\), we define the area under the graph over \([a,b]\) to be \(L\).
In calculus, limits are used to define basic quantities that otherwise would not have a precise meaning. Theorem 1 allows us to define area as a limit \(L\) in much the same way that we define the slope of a tangent line as the limit of slopes of secant lines.
The next three examples illustrate Theorem 1 using formulas for power sums. The \(k\)th power sum is the sum of the \(k\)th powers of the first \(N\) integers. We shall use the power sum formulas for \(k = 1, 2, 3\).
A method for proving power sum formulas is developed in Exercises 40–43 of Section 1.3. Formulas (3)–(5) can also be verified using the method of induction.
\begin{align} \sum\limits_{j=1}^N j\phantom{^2} & = 1\phantom{^2} + 2\phantom{^2} + \cdots + N\phantom{^2} = \frac{N(N+1)}2 = \frac{N^2}2 + \frac N2 \tag{3}\\ \sum\limits_{j=1}^N j^2 & = 1^2 + 2^2 + \cdots + N^2 = \frac{N(N+1)(2N+1)}6 = \frac{N^3}3 + \frac{N^2}2 + \frac N6 \tag{4}\\ \sum\limits_{j=1}^N j^3 &= 1^3 + 2^3 + \cdots + N^3 = \frac{N^2(N+1)^2}4 = \frac{N^4}4 + \frac{N^3}2 + \frac{N^2}4 \tag{5} \end{align}
For example, by Eq. (4),
\[ \sum\limits_{j=1}^6 j^2 = 1^2 + 2^2+ 3^2+ 4^2+ 5^2+ 6^2 = \underbrace{\frac{6^3}3 + \frac{6^2}2 + \frac66}_{\textstyle\frac{N^3}3 + \frac{N^2}2 + \frac N6\text{ for }N=6} =91\]
293
As a first illustration, we compute the area of a right triangle “the hard way.”
Find the area \(A\) under the graph of \(f(x)=x\) over \([0, 4]\) in three ways:
Solution The region under the graph is a right triangle with base \(b=4\) and height \(h = 4\) (Figure 5.11).
REMINDER \[ \begin{aligned} R_N & = \Delta \sum\limits_{j=1}^N f(a+j\Delta x) \\ L_N & = \Delta \sum\limits_{j=0}^{N-1} f(a+j\Delta x) \\ \Delta c & = \frac{b-a}N \end{aligned} \]
In Eq. (6), we apply the formula \[ \sum\limits_{j=1}^N j = \frac{N(N+1)}2 \] with \(N-1\) in place of \(N\): \[ \sum\limits_{j=1}^{N-1} j = \frac{(N-1)N}2 \]
In the next example, we compute the area under a curved graph. Unlike the previous example, it is not possible to compute this area directly using geometry.
294
Let \(A\) be the area under the graph of \(f(x) = 2x^2 - x + 3\) over \([2, 4]\) (Figure 5.12). Compute \(A\) as the limit \(\lim\limits_{N \rightarrow\infty } R_N\).
Solution
Step 1. Express \(R_N\) in terms of power sums.
In this case, \(\Delta x = \tfrac{4-2}N = \tfrac2N\) and \[ R_N = \Delta x \sum\limits_{j=1}^N f(a+j\Delta x) = \frac2N \sum\limits_{j=1}^N f\left(2+\frac{2j}N\right) \] Let’s use algebra to simplify the general term. Since \(f(x)=2x^2-x+3\), \[ \begin{aligned} f\left(2+\frac{2j}N\right) & = 2 \left(2+\frac{2j}N\right)^2 - \left(2+\frac{2j}N\right) + 3\\ & = 2\left(4+\frac{8j}N + \frac{4j^2}{N^2}\right) - \left(2+\frac{2j}N\right) + 3 \\ & = \frac8{N^2} j^2 + \frac{14}N j + 9 \end{aligned} \] Now we can express \(R_N\) in terms of power sums: \[ \begin{align} R_N & = \frac2N \sum\limits_{j=1}^N \left(\frac8{N^2} j^2 + \frac{14}N j + 9\right) \\ & = \frac2N \sum\limits_{j=1}^N\frac8{N^2} j^2 + \frac2N\sum\limits_{j=1}^N\frac{14}N j + \frac2N\sum\limits_{j=1}^N9\\ & = \frac{16}{N^3}\sum\limits_{j=1}^N j^2 + \frac{28}{N^2}\sum\limits_{j=1}^N j + \frac{18}{N}\sum\limits_{j=1}^N 1 \tag{7} \end{align} \]
Step 2. Use the formulas for the power sums.
Using formulas (3) and (4) for the power sums in Eq. (7), we obtain \[ \begin{aligned} R_N & =\frac{16}{N^3} \left(\frac{N^3}3 + \frac{N^2}2 + \frac N6 \right) + \frac{28}{N^2} \left( \frac{N^2}2 + \frac N2\right) + \frac{18}N (N)\\ & = \left(\frac{16}3 + \frac8N + \frac8{3N^2} \right) + \left( 14 + \frac{14}N\right) + 18 \\ & = \frac{112}3 + \frac{22}N + \frac8{3N^2} \end{aligned} \]
Step 3. Calculate the limit.
\[ A = \lim\limits_{N \rightarrow\infty } \left(\frac{112}3 + \frac{22}N + \frac8{3N^2}\right) = \frac{112}3. \]
REMINDERBy Eq. (4) \[ \sum\limits_{j=1}^N j^2 = \frac{N^3}3 + \frac{N^2}2 + \frac N6 \]
Prove that for all \(b > 0\), the area \(A\) under the graph of \(f(x) = x^2\) over \([0, b]\) is equal to \(\tfrac{b^3}3\), as indicated in Figure 5.13.
Solution We’ll compute with \(R_N\). We have \(\Delta x = \tfrac{b-0}{N} = \tfrac{b}{N}\) and \[ R_N = \Delta x \sum\limits_{j=1}^N f(0 + j\Delta x) = \frac bN \sum\limits_{j=1}^N \left(0+j\frac b N\right)^2 = \frac bN \sum\limits_{j=1}^N \left(j^2\frac{b^2}{N^2}\right) = \frac{b^3}{N^3}\sum\limits_{j=1}^N j^2 \] By the formula for the power sum recalled in the margin, \[ \begin{aligned} R_N & = \frac{b^3}{N^3}\left(\frac{N^3}3 + \frac{N^2}2 + \frac N6 \right) = \frac{b^3}3 + \frac{b^3}{2N} + \frac{b^3}{6N^2} \\ &=\lim\limits_{N \rightarrow \infty } \left(\frac{b^3}3 + \frac{b^3}{2N} + \frac{b^3}{6N^2} \right) = \frac{b^3}3 \end{aligned} \]
295
The area under the graph of any polynomial can be calculated using power sum formulas as in the examples above. For other functions, the limit defining the area may be hard or impossible to evaluate directly. Consider \(f(x) = \sin x\) on the interval \([\tfrac\pi4,\frac{3\pi}4]\). In this case (Figure 5.14), \(\Delta x = \tfrac{3\pi/4 - \pi/4}{N} = \tfrac\pi{2N}\) and the area \(A\) is
\[ A = \lim\limits_{N \rightarrow \infty} R_N = \lim\limits_{N \rightarrow \infty} \Delta x \sum\limits_{j=1}^N f(a+j\Delta x) = \lim\limits_{N \rightarrow \infty}\frac\pi{2N} \sum\limits_{j=1}^N \sin\left(\frac\pi4+\frac{\pi j}{2N}\right) \] With some work, we can show that the limit is equal to \(A=\sqrt{2}\). However, in Section 5.3 we will see that it is much easier to apply the Fundamental Theorem of Calculus, which reduces area computations to the problem of finding antiderivatives.
We used the formulas for the \(k\)th power sums for \(k = 1, 2, 3\). Do similar formulas exist for all powers \(k\)? This problem was studied in the seventeenth century and eventually solved around 1690 by the great Swiss mathematician Jacob Bernoulli. Of this discovery, he wrote
"With the help of [these formulas] it took me less than half of a quarter of an hour to find that the 10th powers of the first 1000 numbers being added together will yield the sum
91409924241424243424241924242500"
Bernoulli’s formula has the general form \[ \sum\limits_{j=1}^n j^k = \frac1{k+1}n^{k+1} + \frac12 n^k + \frac k{12}n^{k-1} + \cdots \] The dots indicate terms involving smaller powers of n whose coefficients are expressed in terms of the so-called Bernoulli numbers. For example, \[ \sum\limits_{j=1}^n j^4 = \frac15 n^5 + \frac12n^4 + \frac13n^3 - \frac1{30}n \] These formulas are available on most computer algebra systems.
Power Sums \[ \begin{aligned} \sum\limits_{j=1}^N j\phantom{^2} & = \frac{N(N+1)}2 = \frac{N^2}2 + \frac N2 \\ \sum\limits_{j=1}^N j^2 & = \frac{N(N+1)(2N+1)}6 \\ & = \frac{N^3}3 + \frac{N^2}2 + \frac N6 \\ \sum\limits_{j=1}^N j^3 & = \frac{N^2(N+1)^2}4 = \frac{N^4}4 + \frac{N^3}2 + \frac{N^2}4 \end{aligned} \]